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ABSTRACT 

Data from Online Social Networks (OSNs) are providing 
analysts with an unprecedented access to public opinion on 
elections, news, movies etc. However, caution must be taken 
to determine whether and how much of the opinion extracted 
from OSN user data is indeed reflective of the opinion of the 
larger online population. In this work we study this issue 
in the context of movie reviews on Twitter and compare the 
opinion of Twitter users with that of the online population of 
IMDb and Rotten Tomatoes. We introduce new metrics to 
show that the Twitter users can be characteristically differ- 
ent from general users, both in their rating and their relative 
preference for Oscar- nominated and non-nominated movies. 
Additionally, we investigate whether such data can truly pre- 
dict a movie's box-office success. 

Categories and Subject Descriptors 

H.1.2 [Information Systems]: User/Machine Systems- 
Human factors 

General Terms 

Measurements, Online Social Networks 

Keywords 

Information Dissemination, Movie Ratings, Psychology 

1. INTRODUCTION 

Online social networks (OSN) provide a rich repos- 
itory of public opinion that are being used to analyze 
trends and predict outcomes. But such practices have 
been criticized as it is unclear whether polling based on 
OSNs data can be extrapolated to the general popula- 
tion [2]. Motivated by this need to evaluate the "rep- 
resentativeness" of OSN data, we report on a study of 
movie reviews in Twitter and compare them with other 
online rating sites (e.g., IMDb and Rotten Tomatoes) 
by introducing new metrics on inferrability (Z), posi- 
tiveness {V), bias (B), and hype-approval (T-L). 

Although the context of this work is movie reviews 
in Twitter, its scope extends to other product cate- 
gories and social networks. Twitter is our choice for 



this study because marketers consider brand interac- 
tion and information dissemination as a major aspect 
of Twitter. The focus on movies in this paper is also 
driven by two key factors: 

(a) Right in the Level of Interest: Movies tend to 
generate a high interest among Twitter users as well as 
in other online user population (e.g., IMDb). 

(b) Right in Timing: We collected Twitter data dur- 
ing Academy Award season (Oscars) to obtain a unique 
dataset to analyze characteristic differences between Twit- 
ter and IMDb or Rotten Tomatoes users in their reviews 
of Oscar-nominated versus non-nominated movies. 

We collected data from Twitter between February- 
March 2012 and manually labeled lOK tweets as train- 
ing data for a set of classifiers based on SVM. We focus 
on the following questions to investigate whether Twit- 
ter data is sufficiently representative and indicative of 
future outcomes: 

• Are there more positive or negative reviews about 
movies on Twitter? 

• Do users tweet before or after watching a movie? 

• How does the proportion of positive to negative 
reviews on Twitter compare to those from other movie 
rating sites (e.g.. Rotten Tomatoes, IMDb)? 

• Do the opinions of Twitter users about the Oscar- 
nominated and non-nominated movies differ quantita- 
tively from these other rating sites? 

• Does greater hype and positive reviews on Twitter 
directly translate to a higher rating for the movie in 
other rating sites? 

• How well do reviews on Twitter and other online 
rating sites correspond to box-ofhce gains or losses? 

The paper is organized as follows: Section [2] reviews 
related work. Section[3]discusses the data collection and 
classification techniques used. The results are reported 
in HI followed by conclusions in Section [5] 

2. RELATED WORK 

This work complements earlier works in three related 
topics: (a) OSNs as a medium of information dissemi- 
nation, (b) sentiments analysis, and (c) Twitter's role 
in predicting movies box-office. 



Network Influence. Several works have reported 
on how OSN users promote viral information dissemina- 
tion 1 10, and create powerful electronic "word-of-mouth" 
(WoM) effects (?] through tweets, (gj [l2] study these 
tweets to identify social interaction patterns, user be- 
havior, and network growth. Instead, we focus on the 
sentiment expressed in these tweets on popular new 
movies and their ratings. 

Sentiment Analysis & Box-office Forecasting. 
Researchers have mined Twitter dataset to analyze pub- 
lic reaction to various events, from election debate per- 
formance p\ to movie box-office predictions on the re- 
lease day [l]. In contrast, we improve on the training 
and classification techniques, and specifically focus on 
developing new metrics to ascertain whether opinions 
of Twitter users are sufficiently representative of the 
general online population of sites like IMDb and Rot- 
ten Tomatoes. Additionally, we also revisit the issue of 
how well factors like hype and satisfaction reported in 
the user tweets can be translated to online ratings and 
eventual box-office sales. 

3. METHODOLOGY 

3.1 Data Collection 

From February 2 to March 12, we collected a set of 12 
million tweets (world-wide) using the Twitter Stream- 
ing APQ The tweets were collected by tracking key- 
words in the titles of 34 movies, which were either re- 
cently released (January earliest) or nominated for the 
Academy Awards 2012 ("Oscars"). The details are listed 
in Table H 

There were two limitations with the API. Firstly, the 
server imposes a rate limit and discards tweets when the 
limit is reached, but fortunately, the number of dropped 
tweets accounts for only less than 0.04% of all tweets, 
and rate limiting was observed only during the night 
of the Oscars award ceremony. The second problem is 
the API does not support exact keyword phrase match- 
ing. As a result we received many spurious tweets with 
keywords in the wrong order, e.g., tracking "the grey" 
returns the tweet "a cat in the grey box". To account for 
variations in spacing and punctuation, we used regular 
expressions to filter for movie titles, and after that we 
obtained a dataset of 1.77 million tweets. 

On March 12, we also collected data from IMDb and 
Rotten Tomatoes for box office figures and the propor- 
tion of positive user reviews per movie. 

Deflnition of a Positive Review. In contrast to 
our classification of user tweets with a movie review as 
being positive or negative, the users of the above two 
sites attach a numerical rating to each movie. There- 
fore, we use Rotten Tomatoes' definition of a "positive" 
rating as a binary classifier to convert the movie scores 

https: //dev. twitter . com/docs/streaming-api 



ID 


Movie Titk 


' 




Categorj 


3] 


1 


The Grey 


I 


2 


Underworld: Awakening 


I 


3 


Red Tails 


I 


4 


Man on a Ledge 


I 


5 


Extremely Loud & Incredibly Close 


I 


6 


Contraband 


I 


7 


The Descendants 


I 


8 


Haywire 


I 


9 


The Woman in Black 


I 


10 


Chronicle 


I 


11 


Big Miracle 


I 


12 


The Innkeepers 


I 


13 


Kill List 


I 


14 


W.E. 


I 


15 


The Iron Lady 


II 


16 


The Artist 


II 


17 


The Help 


II 


18 


Hugo 


II 


19 


Midnight in Paris 


II 


20 


Moneyball 


II 


21 


The Tree of Life 


II 


22 


War Horse 


II 


23 


A Cat in Paris 


II 


24 


Chico & Rita 


II 


25 


Kung Fu Panda 2 


II 


26 


Puss in Boots 


II 


27 


Rango 


II 


28 


The Vow 


HI 


29 


Safe House 


HI 


30 


Journey 2: The Mysterious Island 


HI 


31 


Star Wars I: The Phantom Menace 


HI 


32 


Ghost Rider: Spirit of Vengeance 


HI 


33 


This Means War 


III 


34 


The Secret World of Arrietty 


III 



Table 1: List of movies tracked (Ref. footnotes 2,3) 

for comparison with the data from Twitter. Rotten 
Tomatoes defines a review being positive when its nu- 
merical rating is 3.5/5 or above, and the site also pro- 
vides the proportion of positive user reviews. Then for 
IMDb, we can do a simple scaling to give a compati- 
ble definition of a positive review as one with a rating 
of 7/10 or above. The proportion of positive user re- 
views in IMDb is calculated over the per-movie rating 
distributions provided in their website. 

3.2 Tweet Training & Classification 

We classify tweets by relevance, sentiment and tem- 
poral context as defined in Table [2] 

We highlight several design challenges before describ- 
ing the implementation. Some of the movies we tracked 
have terse titles with common words ( The Help, The 
Grey), and as a result many tweets are irrelevant even 
though they contain the titles, e.g., "thanks for the 
help" is a valid tweet. Another difficulty is the large 
number of non-English tweets. Presently we treat them 
as irrelevant, but we intend to include them in future 
work. Lastly, both movie reviews and online social me- 

^Bold indicates the movie was nominated for the Academy 
Awards for Best Picture or Best Animated Feature Film. 
^Trending category: (I) trending as of Feb 2; (II) trending 
as of Feb 7 after Oscars nomination; (III) trending as of Feb 
15 after Valentine's Day. 



Class 


Definition 


Example 


Relevance 

Irrelevant (I) 
Relevant (R) 


Non-English (possibly relevant), or irrelevant from the context 
Otherwise 


"thanks for the help" 
"watched The Help" 


Sentiment 

Negative (N) 
Positive (P) 
Mention (M) 


Contains any negative comment 
Unanimously and unambiguously positive 
Otherwise 


"liked the movie, but don't like how it ended" 
"the movie was awesome!" 
"the movie was about wolves" 


Temporal Context 

After (A) 
Before (B) 
Current (C) 
Don't know (D) 


After watching as inferred from context 

Before watching movie 

Tweeted when person was already inside the cinema 

Otherwise 


"had a good time watching the movie" 
"can't wait to see the movie!" 
"at cinema about to watch the movie" 
"have you seen the movie?" 



Table 2: Definition of tweet classes. 



dia have their specific vocabulary, e.g., "sick" being used 
to describe a movie in a positive sense, and this can 
make lexicon-based approaches common in the senti- 
ment analysis literature 



13 unsuitable. 



To filter irrelevant and non-English tweets while ac- 
counting for Twitter-movie-specific language, we decided 
to take a supervised machine learning approach for tweet 
classification, i.e., learn by example. In particular, for 
each of three meta-classes we train one classifier based 
on Support Vector Machines (SVMs). 

Preprocessing. For each line of text, we remove 
usernames, and convert (1) question and exclamation 
marks, (2) emoticons, (3) URLs (hinting the tweet car- 
ries no sentiment) and (4) isolated @ signs (often used 
to indicate presence at a physical location) to their cor- 
responding meta-words for the next step. 

Feature Vector Conversion. Using the MALLET 
toolkit 111], a line of preprocessed text is converted to 
a binary feature vector, such that an element is 1 if and 
only if the corresponding word or meta-word from the 
previous step exists in the text. Stopwords, e.g., "the", 
are not removed as opposed to usual practice because 
some of them are common in movie titles. 

Training and Classification. We randomly sam- 
pled 10,975 tweets and labeled them according to the 
classification in Table [2] Then we implemented and 
trained the three classifiers with SVM'*^''' [s] and its 
multiclass variant [4]. Finally, we use them to classify 
the remaining 1.7 million unlabeled tweets. 

We compare our classifiers with three baseline clas- 
sifiers: a random one that assigns a class uniformly at 
random, a majority one that assigns to each tweet the 
most represented class in the training set, and the Naive 
Bayes classifier implemented in MALLET. Evaluation is 
done using 10-fold cross validation using the accuracy 
rate, i.e., the ratio of the number of correctly classified 
tweets to the total number. The results in Table |3] in- 
dicate that our SVM-based classifiers outperform the 
baselines by a significant margin. 

Although evaluation results can not be compared across 
papers because different datasets, we note that the 98% 
sentiment classifier in [l] was tested on tweets that have 
distinctive movie titles (avoiding the movie 2012) and 





Relevance 


Sentiment 


Timing 


Random 


0.5 


0.33 


0.25 


Majority 


0.52 


0.55 


0.34 


Naive Bayes 


0.89 


0.74 


0.73 


SVM 


0.93 


0.78 


0.78 



Table 3: Comparison of tweet classifiers 





N 


P 


M 


A 


0.045 


0.13 


0.12 


B 


0.011 


0.17 


0.17 


C 


0.0019 


0.019 


0.090 


D 


0.0097 


0.034 


0.20 



Table 4: Fraction of tweets in joint- classes. 

are relatively easy to classify (unanimously voted by 
three MTurk workers to be in the same class). 

4. DATA ANALYSIS 

In this section, we analyze the Twitter user data to 
characterize whether they are sufficiently representative 
of the general online population. In particular, we com- 
pare the proportion of positive and negative tweets to 
the ratings about movies from other online rating sites, 
e.g., Rotten Tomatoes and IMDb. We introduce met- 
rics to quantitatively characteristic how different Twit- 
ter user reviews were from these other sites, and analyze 
the relationship to box-office sales. 

4.1 Movie Review Statistics 

Out of the 1.77M tweets, 51% of them are classified 
as irrelevant, and we focus on the remaining 49% in 
the remaining of this paper. We use the tweet clas- 
sification of Table [2] to infer the temporal context of 
user tweets. Figure |l(a) shows that a large propor- 



tion of tweets about popular movies are made before 
watching the movie, e.g.. The Women In Black (9), 
Chronicle (10), The Vow (28), etc. Moreover, as shown 



in Figure 1(b) most tweets are helpful in publicizing 



the movies (i.e., WoM) as they often mention screening 
venues and contain positive opinions. Table |4] shows 
the joint tweet distribution by sentiment and temporal 
context. If a person tweets before or after watching, the 
tweet is likely positive. Tweets sent current to watch- 
ing are mostly neutral "check-in's" using location-based 
social networking services. 






(a) Tweeting Time 



(b) Tweet Content 



(c) Positive-Negative Ratio 



Figure 1: Count of tweets |(a)| by temporal context, (b) by sentiment, and (c) P/N ratio of movies 



Analyzing the impact of these positive and negative 
online reviews is an important topic for both network- 
ing and marketing communities. Product ratings on 
sites like Amazon typically have a large number of very 
high and very low scores, which create J-shaped his- 
tograms over the rating scale [6]. This is attributed 
to the "brag-and-moan" phenomenon among reviewers. 
But researchers have also suggested that due to risk- 
averseness among consumers, negative reviews tend to 
have a higher impact than positive reviews. However, 
the impact of these negative reviews can be greatly di- 
minished if they are vastly outnumbered by positive re- 
views. Hence, it is important to examine whether posi- 
tive reviews dominate in proportion to negative reviews 
on OSNs like Twitter. 



Figure 1(c) shows that the number of positive re- 
views on Twitter indeed exceeds the number of nega- 
tive reviews by a large margin for almost all the movies 
trackecQ Such a large positive bias may be due to the 
psychology of sharing positive and helpful image among 
followers. This observation holds some promising im- 
plication in developing general marketing strategies for 
sellers and distributors. For example, instead of focus- 
ing on reducing the negative reviews from a few dissat- 
isfied customers, it may be better to focus on enhancing 
the already high proportion of positive reviews on OSNs 
and use virality effects to influence consumers. 

4.2 Movie Preferences of Twitter Users 

In this section we compare the proportions of positive 
to negative user reviews in Twitter, IMDb and Rotten 
Tomatoes. In Twitter, a positive review is a tweet in 
class AP, and a negative review is a tweet in class ^A^. 
Thus the proportion of positive to negative reviews in 
Twitter is the ratio ^p.^jy ■ ^^^ stringent definition of 
a tweet being positive, i.e., not containing any negative 
comment, makes the ratio an underestimate of the ac- 
tual proportion, and as we will see, can only strengthen 



*The only exceptions were Haywire and Star Wars I: The 
Phantom Menace 3D, for which P/N < 1 as the audience 
and critic responses were uniformly negative. 



our results. We also contrast our definition to existing 
work on sentiment analysis, which can only identify the 
ratio p^^ and is likely to overestimate the proportion 
of positive reviews because of the dominance of positive 
tweets. 

Qualitative Results. Figures 2(a) to |2(d)| show 
the scatter plots of the proportions of positive reviews 
across Twitter, IMDb and Rotten Tomatoes. The dot- 
ted lines in the plots makes an angle 7r/4 (in radians) 
with the X-axis and indicate the location the propor- 
tion being the same in Twitter or IMDb/Rotten Toma- 
toes, i.e., if a datapoint is above the line, then users in 
IMDb/Rotten Tomatoes are more positive than those 
in Twitter on a certain movie, and vice versa. For new 
movies. Figures 2(a) and |2(b)| show that most of the dat- 
apoints are below the dotted lines, which means users in 
Twitter are in general more positive towards the movies 
considereqfl 

Quantitative Results. Here we introduce a set of 
three metrics {V , B,I) to quantitatively summarize the 
discrepancy across two sets of positive review propor- 
tions. Let n be the number of movies considered, Xi be 
the positive proportion for the i-th movie in Twitter, 
and j/i be that in IMDb or Rotten Tomatoes. The met- 
rics V G [0, 1] and B G [—1,1] are defined using the me- 
dian proportion (x* , y*), where x* = median{a;i, . . . , a;„} 
and y* = median{?/i, . . . , y„}. Then we have 



V 



r 



6=1- tan" 



vr 
4' 



V is the Positiveness of the combined population 
of Twitter and IMDb/Rotten Tomatoes users in terms 
of the median (x*,y*). B is the Bias in positiveness 
of Twitter users over IMDb/Rotten Tomatoes as the 
distance between the median and the 7r/4 line. 

The metric I applies the notion of mutual informa- 
tion from information theory (3J. Let the interval [0, 1] 
be divided into m subintervals: 6i = [0,ai], 62 = (01,02], 



'Recall the ratio 



AP 



is an underestimate of the actual 



AP+AN 

proportion for Twitter, so the datapoints should be even 
further below the dotted lines, and our results still hold. 



28*^ 4- 



AP/{AP+AN) Ratio 

(a) IMDb vs Twitter re- 
views (newly released) 



1 i!9 3 



AP/{AP+AN) Ratio 

(b) Rott. Tom. vs Twitter 
reviews (newly released) 



o 0.9 

I 

> 0.8 

% 0.7 

° 0.6 



7 * 
2^2 



AP/(AP+AN) Ratio 



(c) IMDb vs Twitter reviews 
(Oscars) 



AP/(AP+AN) Ratio 

(d) Rotten Tomatoes vs 
Twitter reviews (Oscars) 



Figure 2: (a) and (b) show that the new movies score more positively from Twitter users than the 
general population of IMDb and Rotten Tomatoes, and (c) and (d) show that the oscar-nominated 
movies generally score more positively in IMDb and Rotten Tomatoes than from Twitter users 

release "hype" on Twitter, measured by the number of 
tweets about a movie before its release, to estimate the 
opening day box-office [I]. We extend this notion of 
hype to a more generic metric of hype- approval factor 
to study how well such pre- and post- release hype on 
Twitter correspond to a movie's eventual ratings from 
the general population on IMDb and Rotten Tomatoes. 
Given our ability to classify positive tweets into those 
that were made before watching (i.e., in hype) and after 
watching (i.e., in approval) a movie, we can measure 
their ratio as the hype-approval factor, %: 



Comparison 


V 


B 


X 


Twitter-RT Oscars 
Twitter-IMDb Oscars 


0.79 
0.79 


-0.024 
-0.024 


0.56 
0.42 


Twitter-RT Newly Released 
Twitter-IMDb Newly Released 


0.69 
0.73 


0.13 
0.064 


0.67 
0.52 



Table 5: Summary metrics. 



T bm = {o-m , 1] • Then I is defined as 



mm / ■ ■\ 

^-^ ^-^ PxWPy{j) 



i=i j=i 

where pxii) = ^{{xk-.yk) 

PyU) = #{{xk,yk) 

PxviiJ) = #{{xk,yk) 



Xk e bi}/n 
Vk e bj}/n 
Xk € bi,yk G bj}/n. 



n 



BP # Positive tweets before watching 
AP # Positive tweets after watching 



Intuitively, I quantifies the Inferrability across dif- 
ferent sets of reviews, i.e., if one knows the average rat- 
ing for a movie in Twitter, how accurately he/she can 
use it to predict the average rating on IMDb. This is 
intrinsically related to the spread of datapoints in the 
scatter plots. For example, if there are many movies 
with Xi in some small range but at the same time they 
have very different j/^, knowing a movie to have Xi in 
that range does not help much in predicting its yi value. 

We compute the three metrics for the four pairs of 
ratings with results shown in Table IS] (I is computed 
by dividing [0, 1] into ten equal-sized subintervals). The 
metrics capture what we can observe from the scatter 
plots more concisely: (1) Oscars-nominated movies have 
higher overall ratings (higher V), and (2) Twitter users 
are more positive towards newly released movies {B > 
0). Not obvious from the plots is the ratings in Rotten 
Tomates are closer to Twitter by a higher I. 

4.3 Can Twitter Hype predict Movie Ratings? 

IMDb and Rotten Tomatoes' user ratingf]^ are often 
used as a predictors of a movie's quality and box-office 
potential. With the ready availability of OSN user opin- 
ion as poll data, researchers have proposed using pre- 

®For a fair comparison, we exclude scores from movie critics. 



Using tweets collected over a period of time (e.g., a 
month), if the ratio of ^ w 1, then it indicates that 
the movie lived well up to its hype. A ratio less than 1 
indicates that a movie generated much less hype than 
its post-release audience approval, while a ratio greater 
than 1 is indicative of a high hype that may be further 
heightened by audience approval over time. 

Figure 3(a) (Figure 3(b)| p shows the relationship be- 
tween the fraction of positive ratings for different movies 
from IMDb and Rotten Tomatoes users versus their 
Twitter hype-approval factor, H (hype count, BP). From 
these plots, we see that for either metric, there are sev- 
eral movies with low ^^ (and low BP) that get very 
high scores in both IMDb and Rotten tomatoes (e.g.. 
Chronicle (10), The Secret World of Arrietty (34)). On 
the other hand, some movies that enjoy a higher ^^ 
(and/or high BP) in Twitter can get lower ratings from 
the general population (e.g., The Vow (28)). 

to 



This reaffirms the observation from Figures 2(a) 



2(d) that we need to be cautious in drawing conclusions 
about a movie's success from observed Twitter trends. 
Even accounting for the hype and the approval level in 
Twitter may be insufficient to predict a movie's rating 
from the general online population. 



^In order to have sufficient datapoints across all newly re- 
leased movies, for these two figures and Figure |4] we track 
tweets from the week after release. 




(a) Online Ratings vs. "H (b) Online Ratings vs. BP 

Figure 3: Fraction of positive ratings from IMDb 
and Rotten Tomato versus (a) hype-approval 



factor, Ti, and (b) hype, BP, in Twitter 

4.4 Box-office Gains: Twitter Hype-satisfaction 
or IMDb Ratings? 

Earlier works fll have reported that a higher number 
of positive tweets or "hype" about a movie on Twitter 
directly translates into a higher box-ofhee sales on the 
opening day. However, whether a good box-ofhce sale is 
sustained or not also depends on the amount of positive 
tweets made by satisfied Twitterers after watching the 
movie (i.e., AP tweets), which in turn can induce more 
hype (i.e., BP tweets). We show in Figure 3(a) that a 



high (low) ^ ratio does not necessarily correspond to 
a high (low) rating for a movie in the other sites, and 
hence, it is of interest to explore whether such scores are 
any good indicators of a movie's eventual box-ofhce. 

Figure H] shows the classification of the movies listed 
in Table iTby their Twitter's ^p ratio in the first level, 
by their IMDb scores in the second level, and finally by 
their box-office figures from IMDb. A box-office earning 
of $50 million is taken as a standard valuation for fi- 
nancial success, although the key observations reported 
below will hold for any amount between $20 million to 
$60 million box-office for the given list of movies. The 
figure highlights a few interesting outcomes: 

(a) Even if a movie has ^ < 1 (low hype-approval) 
and IMDb rating < 0.7 (low-score) , it can still become 
financially successful (e.g., Journey 2 (30)). 

(b) Movies that have ^p < 1 (low hype-approval) 
but IMDb rating > 0.7 (high-score), or ^ > 1 (high 
hype- approval) but IMDb rating < 0.7 (low-score), can 
be financially either successful or unsuccessful. 

(c) None of the movies with ^ > 1 and IMDb 
rating > 0.7 have a box-office success of less than $50M. 

In other words, a high score on IMDb, complemented 
with a high hype-approval factor in Twitter, can be in- 
dicative of financial success, but otherwise marketers 
need to be careful about drawing conclusions regarding 
the net box-office outcome for a movie. 

5. CONCLUSIONS 

This paper presents a study that compares data from 
Twitter to other online user population. We show that 




Box Office > $50M? 



Movie ID 9, 29 



28 11, 1, 10 4, 24 30 13, 

12, 32 33, 34 14, 31 



Figure 4: Relationship between IMDb ratings. 
Twitter review scores, and Box-office outcomes 

Twitter users are more positive in their reviews across 
most movies in comparison to other rating sites, like 
IMDb and Rotten Tomatoes. Moreover, compared to 
IMDb and Rotten Tomatoes users, the computed scores 
from Twitter users are slightly less positive for the Oscar- 
nominated best films but more positive for non-nominated 
films, which we quantify by introducing a few inter- 
esting information-theory inspired metrics V, B, and 
I, which together capture the "bias" observed among 
Twitter users. We also introduce a hype-approval met- 
ric "H, measured as a ratio of the total number of posi- 
tive tweets the users make before and after watching a 
movie, and relate it with the ratings for the movie on 
IMDb or Rotten Tomatoes. Finally, we show that scores 
computed from Twitter reviews and other online sites 
do not necessarily translate into predictable box-office. 
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